A Novel Algorithm for Mining Fuzzy High Utility Itemsets
نویسندگان
چکیده
Utility mining is to find the itemsets in a transaction database with high utility values like profits. Although a number of algorithms on high utility mining have been proposed, they did not reflect the fuzzy degree of quantity and profit level for mined high utility itemsets, which are essential for decision making in various applications like stock control and sales analysis. In this paper, we explore to apply fuzzy sets theory to the utility mining problem and propose a novel method, namely FHUI (Fuzzy High Utility Itemsets)-Mine, for mining fuzzy high utility itemsets. In addition to reflecting the fuzzy degree for quantity and profit regions of high utility itemsets, FHUI-Mine also provides a fuzzy threshold range that may include itemsets with profits slightly less than the designated threshold value. To prove the feasibility of FHUI-Mine, it was compared with the well-known Two-Phase algorithm through experimental evaluation. The results show that FHUI-Mine delivers higher mining capability since it can not only mine all high utility itemsets found by Two-Phase algorithm but also discover additional itemsets that are potentially high utility ones.
منابع مشابه
A New Algorithm for High Average-utility Itemset Mining
High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...
متن کاملA Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURI
Classical frequent itemset mining identifies frequent itemsets in transaction databases using only frequency of item occurrences, without considering utility of items. In many real world situations, utility of itemsets are based upon user’s perspective such as cost, profit or revenue and are of significant importance. Utility mining considers using utility factors in data mining tasks. Utility-...
متن کاملData sanitization in association rule mining based on impact factor
Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...
متن کاملTemporal Fuzzy Utility Mining with Upper-Bound
Fuzzy utility mining reflects fuzzy degrees of quantities and profits for high utility itemsets. In generally, transaction time is also concerned, and not all products sold are always on the shelf. Thus, in this paper we present an effective framework, which considers the transaction period of each product from the first transaction it appears to the last transaction in the whole database for m...
متن کاملAn efficient algorithm for mining temporal high utility itemsets from data streams
Utility of an itemset is considered as the value of this itemset, and utility mining aims at identifying the itemsets with high utilities. The temporal high utility itemsets are the itemsets whose support is larger than a pre-specified threshold in current time window of the data stream. Discovery of temporal high utility itemsets is an important process for mining interesting patterns like ass...
متن کامل